Reinforcement Learning based cooperative longitudinal control for reducing traffic oscillations and improving platoon stability

نویسندگان

چکیده

Stop-and-go traffic poses significant challenges to the efficiency and safety of operations. In this study, a cooperative longitudinal control based on Soft Actor Critic (SAC) Reinforcement Learning (RL) is proposed address issue. The reward function carefully designed consider vehicle cooperation achieve three main objectives: safety, efficiency, oscillation dampening. A global performance metric for dampening evaluate developed RL other baseline models. Depending number preceding vehicles that can share maneuver information, two models RL-1 RL-2 are compared with human driven (HD) an adaptive cruise (ACC) model using HighD simulated data. It found information from additional vehicles, dampen shockwaves more efficiently. Specifically, decrease by 15%-36% 15%-42%, respectively, while HD amplifies 14–37%. ACC also but not as effective RL-2. methods further evaluated data collected commercial Model X vehicle. Compared in some controlled settings, better stop-and-go waves generating smaller growth, overshooting, average acceleration/deceleration rate change, suggesting they generalize well new similar environment. Finally, considering platoon different penetration rates. results show consistently outperform shockwaves.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hybrid Cooperative Agents with Online Reinforcement Learning for Traffic Control

This paper presents the application of fuzzy-neuroevolutionary hybrid system with online reinforcement learning for intelligent road traffic management and control. Taking a step away from the conventional traffic control system, the hybrid system presents different methodologies in knowledge acquisition, decisionmaking, learning and goal formulation with the use of a three-layered hierarchical...

متن کامل

Reinforcement Learning For Adaptive Traffic Signal Control

By 2050, two-thirds of the world’s 9.6 billion people will live in urban areas [2]. In many cities, opportunities to expand urban road networks are limited, so existing roads will need to more efficiently accommodate higher volumes of traffic. Consequently, there is a pressing need for technologically viable, low-cost solutions that can work with existing infrastructure to help alleviate increa...

متن کامل

Reinforcement Learning Based PID Control of Wind Energy Conversion Systems

In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...

متن کامل

Flow: Architecture and Benchmarking for Reinforcement Learning in Traffic Control

Flow is a new computational framework, built to support a key need triggered by the rapid growth of autonomy in ground traffic: controllers for autonomous vehicles in the presence of complex nonlinear dynamics in traffic. Leveraging recent advances in deep Reinforcement Learning (RL), Flow enables the use of RL methods such as policy gradient for traffic control and enables benchmarking the per...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transportation Research Part C-emerging Technologies

سال: 2022

ISSN: ['1879-2359', '0968-090X']

DOI: https://doi.org/10.1016/j.trc.2022.103744